Dealing with paralogy in RADseq data: in silico detection and single nucleotide polymorphism validation in Robinia pseudoacacia L.

نویسندگان

  • Cindy F. Verdu
  • Erwan Guichoux
  • Samuel Quevauvillers
  • Olivier De Thier
  • Yec'han Laizet
  • Adline Delcamp
  • Frédéric Gévaudant
  • Arnaud Monty
  • Annabel J. Porté
  • Philippe Lejeune
  • Ludivine Lassois
  • Stéphanie Mariette
چکیده

The RADseq technology allows researchers to efficiently develop thousands of polymorphic loci across multiple individuals with little or no prior information on the genome. However, many questions remain about the biases inherent to this technology. Notably, sequence misalignments arising from paralogy may affect the development of single nucleotide polymorphism (SNP) markers and the estimation of genetic diversity. We evaluated the impact of putative paralog loci on genetic diversity estimation during the development of SNPs from a RADseq dataset for the nonmodel tree species Robinia pseudoacacia L. We sequenced nine genotypes and analyzed the frequency of putative paralogous RAD loci as a function of both the depth of coverage and the mismatch threshold allowed between loci. Putative paralogy was detected in a very variable number of loci, from 1% to more than 20%, with the depth of coverage having a major influence on the result. Putative paralogy artificially increased the observed degree of polymorphism and resulting estimates of diversity. The choice of the depth of coverage also affected diversity estimation and SNP validation: A low threshold decreased the chances of detecting minor alleles while a high threshold increased allelic dropout. SNP validation was better for the low threshold (4×) than for the high threshold (18×) we tested. Using the strategy developed here, we were able to validate more than 80% of the SNPs tested by means of individual genotyping, resulting in a readily usable set of 330 SNPs, suitable for use in population genetics applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development and Evaluation of a Novel Set of EST-SSR Markers Based on Transcriptome Sequences of Black Locust (Robinia pseudoacacia L.)

Black locust (Robinia pseudoacacia L. of the family Fabaceae) is an ecologically and economically important deciduous tree. However, few genomic resources are available for this forest species, and few effective expressed sequence tag-derived simple sequence repeat (EST-SSR) markers have been developed to date. In this study, paired-end sequencing was used to sequence transcriptomes of R. pseud...

متن کامل

Physiological responses of Celtis caucasica L. and Robinia pseudoacacia L. to the cadmium and lead stresses

Afforestation of contaminated areas is considered as a possible strategy for reduction of contaminations. In the present study, the effects of lead (Pb) and cadmium (Cd) were investigated on chlorophyll fluorescence parameters (Fv/Fm, Fo, and Fm), photosynthetic pigments (chlorophyll a, b, and total chlorophyll), and proline in one-year-old seedlings of Celtis caucasica and Robinia pseudoacacia...

متن کامل

Molecular cloning of two classes of Em-like proteins from the seeds of the leguminous tree Robinia pseudoacacia.

To check for the presence of Em-like proteins in seeds of the leguminous tree Robinia pseudoacacia L. (black locust), a cDNA library constructed from mRNA isolated from developing seeds was screened using radiolabeled cDNA encoding the Em protein from Vigna radiata as a probe. Sequence analysis of the identified cDNA clones revealed two classes of Em proteins in Robinia. The nucleotide sequence...

متن کامل

In-silico study to identify the pathogenic single nucleotide polymorphisms in the coding region of CDKN2A gene

Background: CDKN2A, encoding two important tumor suppressor proteins p16 and p14, is a tumor suppressor gene. Mutations in this gene and subsequently the defect in p16 and p14 proteins lead to the downregulation of RB1/p53 and cancer malignancy. To identify the structural and functional effects of mutations, various powerful bioinformatics tools are available. The aim of this study is the ident...

متن کامل

P-196: Association rs3819392 Single Nucleotide Polymorphism within The KIT Gene with Azoospermic Male Infertility

Background: Recent studies have shown that KIT is expressed in the cytoplasm of the spermatogonia, acrosomal granules and leydig cells. Reduction in KIT expression in oligozoospermia with an increase in the germ cell apoptosis process. Three single-nucleotide polymorphisms (SNPs) have been identified and these have been studied to discover KIT role in the male infertility. The aim of this study...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2016